[UNDERTOW-2655] Fix text corruption in FileUtils.readFile when reading multi-byte characters #1834

finalchild · 2025-10-28T15:13:08Z

Summary

Fixes text corruption in FileUtils.readFile when reading multi-byte UTF-8 characters.

Problem: The original implementation read the InputStream into a fixed-size byte buffer (1024 bytes) and decoded each chunk independently. When a multi-byte character sequence was split across a buffer boundary, the decoder received incomplete character data, resulting in replacement characters (�) in the final string.

Solution: Replaced BufferedInputStream with InputStreamReader to handle buffering and character decoding together in a streaming fashion, ensuring multi-byte character sequences are never split.

Note: The implementation is copied from Java 25's InputStreamReader#readAllAsString.

This issue became more significant after fixing UNDERTOW-2337, as large form-data field values are now processed by this vulnerable function. Originally reported in Spring Framework issue #35292.

Issue: UNDERTOW-2655

fl4via

Hi @finalchild ! thanks for your PR, can you please create a test for the fix?

finalchild · 2025-10-31T03:28:26Z

@fl4via
Added tests!

…g multi-byte characters The readFile method was reading the InputStream into a fixed-size byte buffer and decoding each chunk independently. This caused multi-byte UTF-8 character sequences to be split across buffer boundaries, resulting in text corruption with replacement characters. Replaced BufferedInputStream with InputStreamReader to handle buffering and character decoding together in a streaming fashion, ensuring multi-byte character sequences are never split. This issue became more significant after UNDERTOW-2337, as large form-data field values are now processed by this function. Originally reported in Spring Framework issue #35292. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

fl4via added the bug fix Contains bug fix(es) label Oct 28, 2025

fl4via requested changes Oct 29, 2025

View reviewed changes

fl4via added the waiting PR update Awaiting PR update(s) from contributor before merging label Oct 29, 2025

finalchild force-pushed the UNDERTOW-2655 branch from 3e2adaf to 28f3eeb Compare October 31, 2025 03:27

finalchild force-pushed the UNDERTOW-2655 branch from 28f3eeb to b12583b Compare October 31, 2025 03:30

finalchild requested a review from fl4via November 2, 2025 04:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[UNDERTOW-2655] Fix text corruption in FileUtils.readFile when reading multi-byte characters #1834

[UNDERTOW-2655] Fix text corruption in FileUtils.readFile when reading multi-byte characters #1834

Uh oh!

finalchild commented Oct 28, 2025 •

edited

Loading

Uh oh!

fl4via left a comment

Uh oh!

finalchild commented Oct 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[UNDERTOW-2655] Fix text corruption in FileUtils.readFile when reading multi-byte characters #1834

Are you sure you want to change the base?

[UNDERTOW-2655] Fix text corruption in FileUtils.readFile when reading multi-byte characters #1834

Uh oh!

Conversation

finalchild commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

fl4via left a comment

Choose a reason for hiding this comment

Uh oh!

finalchild commented Oct 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

finalchild commented Oct 28, 2025 •

edited

Loading